Action Encoding and Recognition based on Multi-Scale Spatial-Temporal Natural Action Structures
نویسندگان
چکیده
منابع مشابه
Robust Action Recognition Using Multi-Scale Spatial-Temporal Concatenations of Local Features as Natural Action Structures
Human and many other animals can detect, recognize, and classify natural actions in a very short time. How this is achieved by the visual system and how to make machines understand natural actions have been the focus of neurobiological studies and computational modeling in the last several decades. A key issue is what spatial-temporal features should be encoded and what the characteristics of t...
متن کاملAction Recognition Based on Multi-scale Oriented Neighborhood Features
The spatio-temporal (ST) position information between local features plays an important role in action recognition task. To use the information, neighborhood-based features are built for describing local ST information around ST interest points. However, traditional methods of constructing neighborhood, such as sub-ST volumetric method and nearest-neighbor-based neighborhood method, ignore the ...
متن کاملSpatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
Dynamics of human body skeletons convey significant information for human action recognition. Conventional approaches for modeling skeletons usually rely on hand-crafted parts or traversal rules, thus resulting in limited expressive power and difficulties of generalization. In this work, we propose a novel model of dynamic skeletons called SpatialTemporal Graph Convolutional Networks (ST-GCN), ...
متن کاملMulti-Scale Action Recognition in Squash Match
Algorithms for human action recognition usually observe human motion only on particular level of detail. This approach requires complex algorithms to match the complexity of motion. High recognition rates are possible, when actions are distinct and clearly visible. However, this is not the case in many practical applications. To solve this we explore the possibility of developing more general a...
متن کاملSpatio-Temporal VLAD Encoding for Human Action Recognition in Videos
Encoding is one of the key factors for building an effective video representation. In the recent works, super vector-based encoding approaches are highlighted as one of the most powerful representation generators. Vector of Locally Aggregated Descriptors (VLAD) is one of the most widely used super vector methods. However, one of the limitations of VLAD encoding is the lack of spatial informatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Vision
سال: 2014
ISSN: 1534-7362
DOI: 10.1167/14.10.840